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ABSTRACT 

This paper focused on the analysis of test items constructed in the paper of teaching Physical Science for B. Ed. class. It 
involved the analysis of difficulty level and discrimination power of each test item. Item analysis allows selecting or 
omitting items from the test, but more importantly item an a lysis is a tool to help the item writer improve an item. There 
were 100 objective type guestions in the achievement test and was administered to a sample of 100 sfudents from B.Ed. 
colleges. The result of the work showed that most of the test items fall in the acceptable range of difficulty index and 
discrimination index. However, 17 of fhe fest items was excluded due to very high or very low difficulty level and poor 
discrimination power. Purposive sampling was adopted for this study and 100 B. Ed. Physical Science students were taken 
from private B.Ed. Colleges in Pondicherry, India. A test of 100 items was used for data collection. Using the findings 
relationship between the difficulty index of each item and the corresponding discrimination index is carried out using the 
Pearson correlation formula. From the analysis, it was found that item discrimination power increased with the increase 
in difficulty value but got decreased for very easy and very difficult test items. This work is significant for the researcher in 
framing test items with optimum difficulty and discrimination power. The findings also show that, most of the items were 
falling in the acceptable range of difficulty and discrimination level, however, some items were rejected due to their 
poor discrimination index. 

Keywords: Test Item Analysis, Difficulty Level, Discrimination Index, Achievement Test, Test Items. 


INTRODUCTION 

Teaching and learning in classroom can be well assessed 
by achievement test and the two most important 
characteristics of an achievement test are its reliability and 
content validity. For a test to be reliable and valid, a 
systematic selection of items with regard to subject content 
and degree of difficulty is necessary. The effectiveness and 
quality of a test depends upon the individual item. And to 
determine the quality of individual item, item analysis is 
done after the administration and scoring of the 
preliminary draft of the test on the selected sample."ltem 
analysis indicates the difficulty level of each item and 
discriminate between the better and poorer examinees 
(Ebel, 1972). Item analysis has two purposes: First, to identify 
defective test items and secondly, to indicate the content 
the learners have or have not mastered Item analysis 


measures the effectiveness of individual test item in terms 
of its difficulty level and power to distinguish between high 
and low scorers in test (Brown & Frederick, 1971). Thus item 
analysis helps in selecting and retaining the best test items 
in the final draft of the test. Rejecting poor items in the test 
also show the need to review and modify the items. 
Therefore the main purpose of item analysis is to improve 
tests by revising or eliminating ineffective items. The 
procedures of item analysis of an achievement test in 
teaching of physical science in the B.Ed syllabus is 
described in this paper with a stress on the difficulty level 
and discrimination power of items in the test. 

Literature Review 

According to Backhoff, E., Larrazolo, N., & Rosas, M. (2000), 
the Basic Knowledge and Skills Examination (EXHCOBA) is 
one of the few great-scale examinations in Mexico which 
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has been publishing its psychometric parameters. In this 
paper they described the item analysis results, regarding 
the exam's difficulty level and discrimination power. It is 
viewed in the results that most of the items have a medium 
difficulty and a high discrimination power. It also revealed 
that, the mathematical items have better discrimination 
power levels than the ones which belong to social science. 

Suruchi & Surender Singh (2014) conducted a study on the 
analysis of test items constructed in the subject of Biology 
for class IX. It involved the analysis of difficulty level and 
discrimination power of each test item. There were 120 
objective type questions in the achievement test and was 
administered to a sample of 500 students from 
Government and Private Schools. The findings of the study 
showed that, most of the test items fall in the acceptable 
range of difficulty index and discrimination index. However, 
nine of the test items were rejected due to very high or very 
low difficulty level and poor discrimination power. It was 
found that, item discrimination power increased with the 
increase in difficulty value, but got decreased for very easy 
and very difficult test items. This work is significant for the 
researchers and school teachers in framing test items with 
optimum difficulty and discrimination power. 

Gowdhaman & Nachimuthu (2013) aimed to analyze the 
test items of an achievement in the subject of history 
(Indian) of standard 11 According to this study, objective 
tests are widely used to measure intelligence, aptitude, 
and achievement (or attainment). Item analysis allows 
selecting or omitting items from the test, but more to 
important, item analysis is a tool to help the item writer 
improve an item. Item statistics used in the item analysis of 
such criterion-referenced test is the Difference Index (Dl). 
The difference index shows the gain, or difference in 
performance, on each item between the pre-test and 
post-test. Purposive sampling was adopted for this study 
and 15 students were taken from Municipal Girl's Higher 
Secondary School, Gugai, Salem District in Tamil Nadu 
state, India. Atestof 190 items was used for data collection. 
Photographs (358) based teaching history of instruction, 
before and after the tests were conducted. Findings of the 
study were a total 190 items of achievement test for the 
difference index like 80 items are much better related to 


the Curriculum, 25 items are higher values and 85 items are 
not fitting; because they reflect only small gains. So these 
items were rejected. Finally, 105 items were selected form 
of History achievement test. 

Kevin D. Crehan (1974) have developed an achievement 
test in which various item selection techniques are 
compared on consequential criterion referenced reliability 
and validity. Techniques compared include three nominal 
criterion-referenced methods, a traditional point biserial 
selection, teacher selection, and random selection. 
Eighteen volunteer junior and senior high school teachers 
supplied behavioral objectives and item pools ranging 
from 26 to 40 items. Each teacher obtained responses 
from four classes. Pairs of tests of various dimensions were 
developed by each item selection method. Evaluation of 
test reliability and validity were obtained using responses 
independent of the test construction sample. Resultant 
reliability and validity estimates were compared across 
item selection techniques. The findings of the study 
revealed that two of the criterion-referenced item selection 
methods resulted in consistently higher observed validity. 
However, the small magnitude of improvement over 
teacher or random selection raises a question as to 
whether the benefit warrants the necessary extra effort on 
the part of the classroom teacher. 

Michael (1974); Muller, Calhoun, and Orling (1972) 
concluded that test reliability is dependent on the type of 
answer document used by elementary pupils. The present 
study was designed in part to assess the differential effect 
of two pupil response procedures (answering directly in the 
test booklet versus on a separate answer folder) on 
Metropolitan Achievement Tests scores of grades 3 and 4 
pupils. Over 4000 pupils from nine school systems took the 
Metropolitan, half responding in their booklets and half 
using answer folders. The two groups were matched by 
grade in general scholastic aptitude. Although the 
separate answer folder group received lower scores than 
those did the group responding in the test booklets, the 
score reliabilities did not differ significantly for any test, 
Additionally, these reliabilities did not differ significantly from 
comparable metropolitan normative reliabilities. For survey 
achievement tests such as metropolitan, test reliability 
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would not appear to depend on pupil response mode. 
Boopathiraj & Chellamani, K. (2013) have conducted a 
study to analyze the test items of a researcher that made 
test in the subject of Research in Education for the student- 
teachers of Master of Education (M.Ed.). It involves the item 
difficulty and item discrimination. A test of multiple choice 
items was used as a data collection instrument in different 
Colleges of Education to 200 student teachers taken 
randomly. The sample for this study consisted of both 
gender. The findings show that most of the items were 
falling in acceptable range of difficulty and discrimination 
level however some items were rejected due to their to 
poor discrimination index. 

Objectives 

1. To find out the item difficulty level and discrimination 
power of individual test items. 

2, To find out the relationship between degree of item 
difficulty and corresponding power of discrimination of 
test items. 

Research Questions 

The following questions emerged for the present study 

1. Are there differences in the item difficulty level and 
discrimination power of individual test items? 

2. Are there differences in the relationship between 
degree of item difficulty and corresponding power of 
discrimination of test items? 

Methodology 

Sample 

Purposive sampling method was adopted to draw a 
sample size of 100 B.Ed. students of Physical Science 
branch. 

Instrument Used 

An Achievement test of 100 objective test items was used 
for data collection. The test is developed with the help of 
subject experts on the selected three chapters in teaching 
of Physical Science course in B.Ed. Blueprint of the test was 
prepared giving due weightage to the instructional 
objectives, subject content and forms of questions. Bloom's 
revised taxonomy was used to frame the test items. Test 
comprised of twenty fill in the blanks, twenty true or false, 


twenty answer in one word questions and forty multiple 
choice questions. While constructing the items it was 
ensured that, language of test items was simple and 
unambiguous providing no clues towards the correct 
answer. Instructions were made clear on the test and 
answers were written in the question paper. 

Data Collection 

The test was conducted on a sample of 100 students from 
B.Ed. Colleges. Students were given as much time as they 
required to complete the test. Students were instructed to fill 
their answers only in the question paper provided. 

Statistical Analysis 

After scoring the test items, test scores were arranged in 
descending order. To carry out the item analysis top 27% 
scorers and bottom 27% scorers of the total examinees 
were selected. Upper and lower 27% was used as "this 
value maximize the difference in normal distributions while 
providing enough cases for analysis" (Wiersma & Jurs, 
1990). Middle 46% of the test scores were excluded as they 
behave in a similar pattern contributing insignificantly to 
discriminate the performance by students. 

Item Difficulty 

The difficulty of an item is understood as the proportion of 
the persons who answer a test item correctly. The higher this 
proportion, the lower the difficulty. What this means is that, it 
has to do with an inverse relationship: the greater the 
difficulty of an item, the lower its index (Wood, 1960), 

Item difficulty is the percentage of students correctly 
answered the item, also referred to as the p-value. The 
range is from 0% to 100%, the higher the value, the easier 
the item. P values above 0.80 are very easy items and 
might be a concept not worth testing. P-values below 0.20 
indicate difficult items and should be reviewed for possible 
confusing language or the contents needs revision. 

The following formula is used to find difficulty level. 

DL=R U +R/N U +N, 

where, 

R u = The number students in the upper group who 
responded correctly 

R, = The number students in the lower group who 
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Difficulty value 

Quality 

Recommendation 

Discriminative Index 

Item Evaluation 




0.40 and above 

Very good item 

Below 0.20 

Very difficult 

misleading Discard 

0.30 to 0.39 

Good but possibly subject 

0.20 to 0.50 

Good 

Retain 

0.20 to 0.29 

Marginal items subjected 

0.50 to 0.80 

Best 

Retain 


to improvement 

Above 0.80 

Very Easy 

Poor item Discard 

0.19 and below 

Poor items to be rejected 





or improved by revision 

Table 1, 

Difficulty Level and Item Evaluation 

Table 2. Discriminative Indices and Item Evaluation 


responded correctly 

N u = Number of students in the upper group 
N,= Number of students in the lower group 

Higher the values of the difficulty index, easier the item is 
(Brown, 1983), If most students answered an item correctly, 
then the item was an easy one. If most students answered 
an item incorrectly, then it should have been a difficult one. 
So, the items answered correctly by 100% or 0% of the 
examinees are insignificant. Table 1 shows the difficulty 
level and item ecaluation 
Item Discrimination 

Item discrimination defines the index of indiscrimination as 
the ability of an item on the basis of which the 
discrimination is made between superiors and inferiors. 
(Blood and Budd, 1972). Item discrimination or the 
discriminating power of a test item refers to the degree to 
which success or failure of an item indicates control of the 
ability being measured. It determines the extent to which 
the given item discriminates among examinees in the 
function or ability measured by the item. This value ranges 
between 0.0 and 1.00. Higher the value, more 
discrimination of the item. A highly discriminating item 
indicates that, the students who had high tests scores got 
the item correct whereas students who had low test scores 
got the item incorrect. 

Discrimination power is estimated using the following 
formula: 

Discrimination power = Ru-R/N^orJNL 

R u = The number students in the upper group who 

responded correctly. 

R, = The number students in the lower group who 
responded correctly. 

N u = Number of students in the upper group. 

N,= Number of students in the lower group. 


Tests with high internal consistency consist of items with 
most positive relationships with total test score. In practice, 
values of the discrimination index will seldom exceed 0.50 
because of the differing shapes of an item and total score 
distributions. ScorePak® classifies item discrimination as 
"good" if the index is above 0.30; "Fair" if it is between 0.10 
and 0.30; and "poor" if it is below 0.10. Table 2 shows the 
Discriminative Indices and Item Evaluation. 

Findings 

The findings of item analysis on 100 test items can be 
understood by the Table 3. 

The test items provided for achievement test in teaching of 
Physical Science is hundred. The item difficulty index and 
item discrimination was calculated using respective 
formulas and values are tabulated. Based on the value, 


Item 

No. 

Mean 

SD 

Difficulty 

Difficulty 

level 

Discrimination 

power 

No. of 
Students 

1 

0.42 

0.496 

42 

Medium 

0.407 

0.444 

Good 

100 

2 

0.62 

0.488 

62 

Medium 

0.611 

0.259 

Fair 

100 

3 

0.58 

0.496 

58 

Medium 

0.537 

0.333 

Good 

100 

4 

0.42 

0.496 

42 

Medium 

0.389 

0.407 

Good 

100 

5 

0.29 

0.456 

29 

Hard 

0.259 

0.519 

Good 

100 

6 

0.39 

0.490 

39 

Hard 

0.352 

0.407 

Good 

100 

7 

0.49 

0.502 

49 

Medium 

0.444 

0.741 

Good 

100 

8 

0.68 

0.469 

68 

Medium 

0.574 

0.778 

Good 

100 

9 

0.49 

0.502 

49 

Medium 

0.481 

0.889 

Good 

100 

10 

0.53 

0.502 

53 

Mediu 

0.519 

0.296 

Fair 

100 

11 

0.44 

0.499 

44 

Medium 

0.407 

0.519 

Good 

100 

12 

0.67 

0.473 

67 

Medium 

0.648 

0.333 

Fair 

100 

13 

0.74 

0.441 

74 

Easy 

0.630 

0.370 

Fair 

100 

14 

0.46 

0.501 

46 

Medium 

0.481 

0.815 

Fair 

100 

15 

0.42 

0.496 

42 

Medium 

0.444 

0.593 

Fair 

100 

16 

0.62 

0.488 

62 

Medium 

0.500 

0.556 

Fair 

100 

17 

0.59 

0.494 

59 

Medium 

0.556 

0.370 

Fair 

100 

18 

0.62 

0.488 

62 

Medium 

0.500 

0.481 

Fair 

100 

19 

0.48 

0.502 

48 

Medium 

0.481 

0.593 

Fair 

100 

20 

0.45 

0.500 

45 

Medium 

0.500 

0.556 

Fair 

100 

21 

0.76 

0.429 

76 

Medium 

0.778 

0.000 

Poor 

100 

22 

0.62 

0.488 

62 

Medium 

0.648 

0.185 

Fair 

100 

23 

0.83 

0.378 

83 

Medium 

0.815 

0.074 

Poor 

100 
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24 

0.66 

0.476 

66 

Medium 

0.704 

0.074 

Poor 

100 

25 

0.79 

0.409 

79 

Medium 

0.796 

0.110 

Poor 

100 

26 

0.63 

0.485 

63 

Medium 

0.667 

-0.150 

Poor 

100 

27 

0.61 

0.984 

61 

Medium 

0.519 

0.074 

Poor 

100 

28 

0.71 

0.456 

71 

Medium 

0.722 

0.333 

Good 

100 

29 

0.67 

0.473 

67 

Medium 

0.648 

0.037 

Poor 

100 

30 

0.60 

0.492 

60 

Medium 

0.556 

0.296 

Fair 

100 

31 

0.83 

0.378 

83 

Medium 

0.778 

0.148 

Fair 

100 

32 

0.40 

0.492 

40 

Hard 

0.426 

-0.040 

Poor 

100 

33 

0.75 

0.435 

75 

Medium 

0.685 

0.185 

Fair 

100 

34 

0.53 

0.502 

53 

Medium 

0.500 

0.185 

Fair 

100 

35 

0.52 

0.502 

52 

Medium 

0.481 

0.000 

Poor 

100 

36 

0.41 

0.494 

41 

Medium 

0.370 

0.074 

Poor 

100 

37 

0.57 

0.498 

57 

Medium 

0.574 

0.037 

Poor 

100 

38 

0.72 

0.451 

72 

Medium 

0.741 

-0.07 

Poor 

100 

39 

0.61 

0.490 

61 

Medium 

0.630 

0.074 

Poor 

100 

40 

0.79 

0.409 

79 

Medium 

0.796 

0.111 

Fair 

100 

41 

0.53 

0.502 

53 

Medium 

0.519 

0.815 

Good 

100 

42 

0.58 

0.496 

58 

Medium 

0.593 

0.444 

Good 

100 

43 

0.43 

0.498 

43 

Medium 

0.500 

0.407 

Good 

100 

44 

0.38 

0.488 

38 

Hard 

0.352 

0.556 

Good 

100 

45 

0.59 

0.494 

59 

Medium 

0.537 

0.704 

Good 

100 

46 

0.42 

0.496 

42 

Medium 

0.481 

0.741 

Good 

100 

47 

0.51 

0.502 

51 

Medium 

0.556 

0.667 

Good 

100 

48 

0.52 

0.502 

52 

Medium 

0.519 

0.593 

Good 

100 

49 

0.46 

0.501 

46 

Medium 

0.463 

0.556 

Good 

100 

50 

0.43 

0.498 

43 

Medium 

0.463 

0.630 

Good 

100 

51 

0.44 

0.499 

44 

Medium 

0.537 

0.556 

Good 

100 

52 

0.31 

0.465 

31 

Hard 

0.333 

0.519 

Good 

100 

53 

0.63 

0.485 

63 

Medium 

0.519 

0.741 

Good 

100 

54 

0.56 

0.499 

56 

Medium 

0.556 

0.815 

Good 

100 

55 

0.46 

0.501 

46 

Medium 

0.481 

0.444 

Good 

100 

56 

0.49 

0.502 

49 

Medium 

0.537 

0.704 

Good 

100 

57 

0.52 

0.502 

52 

Medium 

0.426 

0.407 

Good 

100 

58 

0.26 

0.441 

26 

Hard 

0.370 

0.593 

Good 

100 

59 

0.43 

0.498 

43 

Medium 

0.444 

0.370 

Good 

100 

60 

0.54 

0.501 

54 

Medium 

0.519 

0.667 

Good 

100 

61 

0.58 

0.496 

58 

Medium 

0.556 

0.222 

Fair 

100 

62 

0.58 

0.516 

58 

Medium 

0.574 

0.407 

Good 

100 

63 

0.48 

0.502 

48 

Medium 

0.307 

0.222 

Fair 

100 

64 

0.73 

0.446 

73 

Easy 

0.667 

0.000 

Poor 

100 

65 

0.55 

0.500 

55 

Medium 

0.611 

0.259 

Fair 

100 

66 

0.57 

0.498 

57 

Medium 

0.593 

0.296 

Fair 

100 

67 

0.60 

0.492 

60 

Medium 

0.611 

0.037 

Poor 

100 

68 

0.68 

0.469 

68 

Medium 

0.648 

0.185 

Fair 

100 

69 

0.54 

0.501 

54 

Medium 

0.593 

0.148 

Fair 

100 

70 

0.53 

0.502 

53 

Medium 

0.593 

0.000 

Poor 

100 

71 

0.47 

0.502 

47 

Medium 

0.463 

0.259 

Fair 

100 

72 

0.43 

0.498 

43 

Medium 

0.444 

0.444 

Good 

100 

73 

0.28 

0.451 

28 

Hard 

0.278 

0.333 

Good 

100 

74 

0.46 

0.501 

46 

Medium 

0.463 

0.407 

Good 

100 

75 

0.71 

0.456 

71 

Medium 

0.648 

0.111 

Fair 

100 

76 

0.55 

0.500 

55 

Medium 

0.537 

0.259 

Fair 

100 

77 

0.25 

0.435 

25 

Hard 

0.315 

0.185 

Fair 

100 

78 

0.50 

0.503 

50 

Medium 

0.574 

0.259 

Fair 

100 

79 

0.69 

0.465 

69 

Medium 

0.685 

0.185 

Fair 

100 

80 

0.55 

0.500 

55 

Medium 

0.537 

0.259 

Fair 

100 

81 

0.52 

0.502 

52 

Medium 

0.537 

0.259 

Fair 

100 

82 

0.36 

0.482 

36 

Hard 

0.333 

0.370 

Good 

100 

83 

0.41 

0.494 

41 

Medium 

0.426 

0.111 

Fair 

100 

84 

0.48 

0.502 

48 

Medium 

0.556 

0.296 

Fair 

100 

85 

0.43 

0.498 

43 

Medium 

0.426 

0.407 

Good 

100 

86 

0.41 

0.494 

41 

Medium 

0.407 

0.296 

Fair 

100 

87 

0.62 

0.488 

62 

Medium 

0.63 

0.148 

Fair 

100 

88 

0.48 

0.502 

48 

Medium 

0.500 

0.481 

Good 

100 

89 

0.42 

0.496 

42 

Medium 

0.463 

0.481 

Good 

100 

90 

0.57 

0.498 

57 

Medium 

0.648 

0.333 

Good 

100 

91 

0.40 

0.492 

40 

Hard 

0.389 

0.481 

Good 

100 

92 

0.42 

0.496 

42 

Medium 

0.481 

0.667 

Good 

100 

93 

0.37 

0.485 

37 

Hard 

0.426 

0.111 

Fair 

100 

94 

0.61 

0.490 

61 

Medium 

0.611 

0.259 

Fair 

100 

95 

0.47 

0.502 

47 

Medium 

0.537 

0.333 

Good 

100 

96 

0.54 

0.501 

54 

Medium 

0.574 

0.481 

Good 

100 

97 

0.60 

0.492 

60 

Medium 

0.593 

0.444 

Good 

100 

98 

0.50 

0.503 

50 

Medium 

0.574 

0.407 

Good 

100 


99 0.52 0.502 52 Medium 0.481 0.074 Poor 100 

100 0.58 0.496 58 Medium 0.574 0.333 Good 100 

Data source: Authors' B.Ed. Student's Achievement Test 

Table 3. Item Analysis 

each item is described as Poor, Fair, Good, Hard, Medium 
and Easy. 

Discussion 

The discussion is made based on the research question 
framed for this study. This study aimed to study the 
differences in the item difficulty level and discrimination 
power of the individual test item. The findings of the study 
reveal that seventeen items out of 100 items were rejected 
either due to difficulty level or discrimination Index. Thirty 
seven items fall in the range of 0.1 to 0.3 and they are 
named as fair items. Remaining forty six items are good 
items as they have the discrimination value above 0.3. 
Therefore Eighty three items were accepted provided with 
necessary revision. Table 4 shows the rejected item (Poor 
item <0.1). 


Difficulty Hard (0-50) Medium (50-85) Easy (85-100) Total 


Poor(<0.1) 

1 

15 

1 

17 

Discrimination Fair (0.1-0.3) 

2 

34 

1 

37 

Good(<0.3) 

8 

38 

0 

46 

Total 

11 

87 

2 

100 


Table 4. Relationship between Item Difficulty 
and Item Discrimination 



Figure 1. Item Frequency for Each Difficulty Index Range. 
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Item Difficulty and Frequency Graph 

Figure 1, shows the graph of difficulty indices for the 100 
items frequency, no items fall below 0.2 and only three item 
falls above 0.80 difficulty level. Atotal of 97 items are in the 
range of 0.2 to 0.80. The three items are found to be very 
easy with difficulty value above 0.80. And that three very 
easy test items were rejected for the final draft of 
achievement test. Hence it shows that if the difficulty index 
increases the difficulty of an item decreases. 

Item Discrimination and Frequency Graph 

Figure 2, the graph shows discrimination power of test items 
clearly and mostly 81% of the test items had a 
discrimination index of 0.40 and above and fall in the 
category of excellent items. Thirty seven items are 
considered as fair with a discriminative power range of 0.30 
to 0.39 while seventeen items with Discriminative power 
range between 0.20 to -o.2 needs improvement and are 
rejected. 

Figure 3 shows the relationship between difficulty level and 
discrimination power calculated for an achievement test 
for B.Ed., students. It shows a moderate negative 
relationship between values of Difficulty index and 
Discrimination power. This negative correlation signifies that 



Figure 2. Item Frequency for Each Discrimination 
Index Range. 


—♦—Difficulty level —1— Discrimination power 



Figure 3. Relationship between Difficulty Level and 
Discrimination Power. 

as the difficulty index increased discrimination index also 
increase but to an optimum value only after which 
discrimination power decrease with the increase in 
difficulty level. This suggested that the easier items (>0.80) 
or too difficult items (<0.20) poorly discriminate between 
the superior and inferior examinees. However in this study 
no wide scattering was observed, Thus, after the item 
analysis of test items nine items were recommended to 
reject for the final drafting of achievement test. 

Recommendations 

The study suggests the following recommendations for 
further studies: 

• Standardization is important to compare and contrast 
the student teachers perspectives regarding item 
analysis for conducting achievement test. 

• Investigating the impact of item analysis on 
developing student teachers at B.Ed., level for 
improving quality education. 

• It increases vigorous efforts to standardize 
achievement tests, which generally assess 
competency of student teachers. 

• It creates a climate to develop item analysis to cover 
wide range of skills and knowledge in selected paper 
for test. 

Conclusion 

The findings of this study imply the importance of the item 
analysis for defining the quality and efficacy of individual 
test item in constructing a more reliable test. The study 
proposes that test items with good positive discrimination 
power and medium difficulty are ideal for a good test 
whereas these items with negative or zero discrimination 
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power and having very low or very high difficulty level 
should be rejected. Item analysis results are tentative and 
are influenced by the quantity and nature of students, 
instructional techniques applied, chance errors and 
purpose of the test. This work can be repeated in other 
subjects to develop a good item bank for student 
community. 
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